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Abstract. This paper presents a new approach for filter design based on 
stochastic distances and tests between distributions. A window is defined 
around each pixel, samples are compared and only those which pass a 
goodness-of-fit test are used to compute the filtered value. The technique 
is applied to intensity Synthetic Aperture Radar (SAR) data, using the 
Gamma model with varying number of looks allowing, thus, changes in 
heterogeneity. Modified Nagao-Matsuyama windows are used to define 
the samples. The proposal is compared with the Lee's filter which is 
considered a standard, using a protocol based on simulation. Among the 
criteria used to quantify the quality of filters, we employ the equivalent 
number of looks (related to the signal-to- noise ratio), line contrast, and 
edge preservation. Moreover, we also assessed the filters by the Universal 
Image Quality Index and the Pearson's correlation between edges. 
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1 Introduction 

SAR plays an important role in Remote Sensing since they provide complementary 
information to that provided by optical sensors. SAR data are subjected to speckle 
noise, which is also present in laser, ultrasound-B, and sonar imagery [4 j. This 
noise degrades the SAR information content and makes image interpretation 
classification difficult [5 . 

Statistical analysis is essential for dealing with speckle. In addition, statistical 
modeling provides support for the development of algorithms for interpreting the 
data efficiently, and for the simulation of plausible images. Different statistical 
distributions are proposed in the literature to describe speckled data. We used 
the multiplicative model in intensity format for homogeneous areas, ergo the 
Gamma distribution was employed to describe the data [3]. 

This work presents new filters based on stochastic distances and tests between 
distributions, as presented in Nascimento et al. [8]. The filters are compared to 
Lee's filter using a protocol proposed by Moschetti et al. [6 using Monte Carlo 
simulation. The criteria used to evaluate this filters are the equivalent number of 
looks, line contrast, edge preserving, the Q index [10] and Pearson's correlation 
between edges. 
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The paper is organized as follows: In Section [2] we summarise the model for 
speckle data. Section [3] describe the new filters. Section [4] presents the measures 
for assessing the quality of filtered images, with conclusions drawn in Section [5] 



2 The Multiplicative Model 

According to Goodman [4] , the multiplicative model can be used to describe S AR 
data. This model asserts that the intensity observed in each pixel is the outcome of 
the random variable Z\ Q — >• R + which is the product of two independent random 
variables: X: Q — >• R + , that characterizes the backscatter; and Y: Q — »> R+, which 
models the speckle noise. The distribution of the observed intensity Z — XY is 
completely specified by the distributions of X and Y. 

This proposal deals with homogeneous regions in intensity images, so a con- 
stant X ~ A > defines the backscatter, and the speckle noise is described by a 
Gamma distribution Y ~ r(L, L) with unitary mean IE(Y) = 1, where L > 1 is 
number of looks. Thus, it follows that Z ~ r{L,L/X) with density 

/,(^,A) = ^^exp{^}, (1) 

where r is the gamma function, z, A > and L > 1. 



3 Stochastic Distances Filter 



The proposed filter is based on stochastic distances and tests between distribu- 
tions [8], obtained from the class of (/i, 0) -divergences. It employs neighborhoods 
as defined by Nagao and Matsuyama [7 , presented in Figure l(a)[ and extended 
versions as shown in Figure l(b)| 



H 



(a) 5 x 5 neighborhood (b) 7 x 7 neighborhood 

Fig. 1. Nagao-Matsuyama neighbourhoods. 
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Each filtered pixel has a 5 x 5 neighborhood (see Figure l(a)| ) or a 7 x 7 



neighborhood (see Figure [T(b)| ) , within which nine disjoint areas are denned. 
Denote 0\ the estimated parameter in the central 3 x 3 or 5 x 5 neighborhood, 
respectively, and (02 > • • • ? 09 ) the estimated parameters in the eight remaining 
areas. To^account for possible departures from the homogeneous model, we 
estimate Oi = (L^A^) by maximum likelihood; reduced equivalent number of 
looks are related to heterogeneous areas pQ. 

Based on a random sample of size n, z = (zi, £2, • • • , z n ), the likelihood 
function related to the r{L,L/X) distribution is given by 

/ t L \ n n j- 

Thus, maximum likelihood estimators for (L,A), namely, (L, A), are the solution 
of the following system of non-linear equations: 

f InL - - In (i E"=i + £ E"=i ^ *j = 0, 



nL _i_ _£ 
A A 



nL j_ L s~^n ^ 

Z^ 7 = l Z j — U > 



where ip° is the digamma function. 

The proposal is based on the use of stochastic distances on small areas 
within the filtering window. Consider Z\ and Zi random variables defined on 
the same probability space, whose distributions are characterized by the den- 
sities and fzi(zi]0i), respectively, where 0\ and Oi are parameters. 
Assuming that the densities have the same support IcE, the h-<j) divergence 
between fz x and fzi is given by 

D$(Z 1 ,Z i ) = h( [ ^(^4^It) fztfaOJdx), (3) 

where h: (0, oo) — » [0, oo) is a strictly increasing function with h(0) = and 
h'[x) > for every x G R, and 0: (0, oo) — » [0, oo) is a convex function [9]. 
Choices of functions h and result in several divergences. 

Divergences sometimes are not distances because they are not symmetric. 
A simple solution, described in [8], is to define a new measure d!^ given by 
d^(Z u Zi) = (D^Z^Zi) + D^(Z i ,Z 1 ))/2. Distances, in turn, can be conve- 
niently scaled to present good statistical properties that make them statistical 
hypothesis tests [8]: 

^0i,£) = ^4(0iA), (4) 

where 0i and Oi are maximum likelihood estimators based on samples size m 
and n, respectively, and k = (//(O)^") -1 . When 0\ = Oi, under mild conditions 
S^(0i, 0i) is asymptotically Xm distributed, being M the dimension of 0\. Ob- 
serving Sh{0\, Oi) = s, the null hypothesis Oi = Oi can be rejected at level 77 if 
P r (xif > s) < r]. Details can be seen in the work by Salicru et al. [9]. 
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Since we are using the same sample for eight tests, we modified the value 
of r\ by a Bonferroni-like correction, namely, Sidak correction, that is given by 
77 = 1 — (1 — a) 1 /*, where t is the number of tests and, a the level of significance 
for the whole series of tests. 

Nascimento et al. S\ derived several distances for the Q° model, which includes 
the one presented in Equation 0. We opted for the latter, due to the numerical 
complexity of the former; the lack of flexibility is alleviated by allowing the 
number of looks to vary locally. The statistical tests used in this paper are then: 

Hellinger test: S H = ^ (l 2L &^ L/2 



m + n y (Ai+A;) L 

Kulback-Leibler test: S KL = ^ ^l^f 

Renyi test of order /3: = ^ 0R( Z R n log ^ ~ A w~ ry, in 

J V R m+n 2/3(/3-l) & (fi. + ^pfij (^ + (1-/3)3^) ' 

which < ft < 1. 

Although these are all different tests, in practice they led to exactly the same 
decisions in all situations here considered. We, therefore, chose to work only with 
the test based on the Hellinger distance since it has the smallest computational 
cost in terms of number of operations. 

The filtering procedure consists in checking which regions can be considered 
as coming from the same distribution that produced the data in the central block. 
The sets which are not rejected are used to compute a local mean. If all the 
sets are rejected, the filtered value is updated with the average on the central 
neighborhood around the filtered pixel. 



4 Results 



Image quality assessment in general, and filter performance evaluation in partic- 
ular, are hard tasks 6 . A "good" technique must combat noise and, at the same 
time, preserve details as well as relevant information. In the following we assess 
the filters by two approaches. Firstly, we use simulated data; with this, we are 
able to compare the true image (phantom) with the result of applying filters to 
corrupted version of the phantom. Secondly, we apply measures of quality to a 
real image and its filtered version. 



4.1 Simulated data 

The Monte Carlo experiment discussed in Moschetti et al. [6] consists of sim- 
ulating corrupted images with different parameters. Each simulated image is 
subjected to filters, and quality measures are computed from each result. The 
quality of the filter with respect to each measure can then be assessed analyzing 
the data, not just a single image. We use a phantom image (see Figure 2(a) ) 
which consists of light strips and points on a dark background, and we corrupt it 
with speckle noise (see Figure 2(b)). The following measures of quality on the 
filtered versions as, for instance, Figures 2(c) and |2(d)[ are then computed: 
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Equivalent number of looks: in intensity imagery and homogeneous areas, it 
can be estimated by NEL = (z/dz) 2 , i-e., the square of the reciprocal of the 
coefficient of variation. In this case, the bigger the better. 

Line contrast: the preservation of the line of one pixel width will be assessed by 
computing three means: in the coordinates of the original line {xz) and in two 
lines around it [xg x and X£ 2 ). The contrast is then defined as 2xg — [xg x +x^ 2 ), 
and compared with the contrast in the phantom. The best values are the 
smallest. 

Edge preserving: it is measured by means of the edge gradient (the absolute 
difference of the means of strip around edges) and variance (same as the 
former but using variances instead of means). The best values are the smallest. 

The Q index: Q = -2 T-2 2 2^ \ , where si and • denote the sample variance 

s x Sy x -\-y ^x'^y 

and mean, respectively. The range of Q is [—1,1], being 1 the best value. 

The B n index: B n = J .„„I J . ... it is a correlation measure is 

H H vEj=iW-^) 2 Ej=i(2/j-J/) 2 

between the Laplacians of images X and Y, where •j and • denote the 
gradient values of the jth pixel and mean of the images V 2 X and V 2 F, 
respectively. The range of f3 p is [—1,1], being 1 the perfect correlation. 



(a) Phantom (b) Corrupted (c) Lee's filter (d) Hellinger filter 

Fig. 2. Lee's Protocol, speckled data and filtered images. 



The proposed filter was compared with Lee's filter [5] which is considered 
a standard. All filters were applied only once for the whole series of tests. The 
results obtained are summarized by means of boxplots. Each boxplot describes the 
results of one filter applied to 100 images formed by independent samples of the 
r(L,L/X) distribution with the parameters shown in Table [l] These parameters 
describes situations usually found when analyzing SAR imagery in homogeneous 
regions. The tests were performed at the 80%, 90% and 99% level of significance. 

Figure [3] shows the boxplot of six measures corresponding to four filters. 
Vertical axes are coded by the filter ('L' for Lee and 'H' for Hellinger), the 
situation ID (from 1 to 4, see Table [T]), the filter size (5x5 and 7x7). Only 
results at the 99% level of significance are shown; the rest is consistent with this 
discussion. 

Lee's filter presents better results than our proposal with respect to Edge 



Variance and the /3 P index in most of the considered situations, c.f. figures 3(d) 
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Table 1. Simulated situations with the r(L,L/X) distribution. 



Situation ID 


L 


A 


Background mean 


#1 


1 


200 


70 


#2 


3 


195 


55 


#3 


5 


150 


30 


#4 


7 


170 


35 



and |3(f)| The filters based on stochastic distances consistently outperform Lee's 



filter with respect to all the other criteria, namely Number of Looks (figure 3(a) ), 
Line Contrast (figure 3(b)), Edge Gradient (figure [3(c)| , and the Universal Index 



Quality measure (figure 1 3 (e)| . 



4.2 Real data 



Not all the quality measures presented in Section |4.1| can be applied to real data, 
unless the ground truth is known. For this reason, the following metrics will be 
used in this case [2 j, where the smaller is the better (they are all error measures): 

Mean Absolute Error: MAE = n" 1 Y!j=i \ x j ~ VjV 
Mean Square Error: MSE = n" 1 YTj=i( x j ~ Vj? • 
Normalized Mean Square Error: NMSE = E "=^* 3 'T )2 • 

^j = l X 3 

Distortion Contrast: DCON = n _1 Y^j=i J+^.+J. 7 wnere a depends on the 
relationship between luminance and gray level of the display; we used a = 
23/255. 

Figure [4] presents the real image, its filtered versions and analysis 1-D of the 
row = 50. The original data were produced by the E-SAR sensor in the L band 
(HH polarization) with 2.2 x 3.0 m of ground resolution and four nominal looks. 
Nascimento et al. [8] analyzed this image, and the equivalent number of looks in 
homogeneous areas is always below three. 

The Lee filtered image is smoother that the ones obtained with stochastic 
distances, but comparing figures |4(b)| and |4(c)| one notices that our proposal 



retains much more detail than the classical technique. Figure 4(d) presents the 
profile of the images in the highlighted line. While combating speckle with almost 
the same success: the bright spots in the upper right corner, which are seldom 
visible after applying the Lee filter, stand out in the image filtered with the 
Hellinger distance and windows of side 5 at a level a = 80%. 

Table [2] presents the assessment of the filters, and we note that the Hellinger 
filter of order 5 with a = 80% achieved the best results. 



5 Conclusions 

We presented new filters based on stochastic distances for speckle noise reduction. 
The proposal was compared with the classical Lee filter, using a protocol based 
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(a) Equivalent Number of Looks (b) Line Contrast 




(c) Edge Gradient (d) Edge Variance 



L2w5 - 
H1w7 99 - 
H1w5 99 - 



0.12 0.14 0.16 0.18 0.20 0.22 0.24 



(e) Values of Q 



(f) Values of , 



Fig. 3. Boxplots of measures applied to four niters in four situations. 



on Monte Carlo experiences, showing that it is competitive. An applications to 
real SAR data was presented and, numerical methods were used to assert the 
proposal. The proposed niters behave nearly alike, and they outperform the Lee 
filter in almost all quality measures. However, other significance levels will be 
tested, along with different points of parameter space to have a yet more complete 
assessment of proposal. The proposal can be extended to any problem, requiring 
only the computation of stochastic distances. 
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0.250 


0.189 


99% 


H_w5 


0.127 


0.029 


0.085 


0.166 


0.397 


0.180 




H_w7 


0.145 


0.036 


0.109 


0.185 


0.222 


0.189 



Geoscience and Remote Sensing 47(11), 3795-3809 (2009) 

2. Baxter, R., Seibert, M.: Synthetic aperture radar image coding. MIT Lincoln 
Laboratory Journal 11(2), 121-158 (1998) 

3. Gao, G.: Statistical modeling of SAR images: A Survey. Sensors 10, 775-795 (Jan 
2010) 

4. Goodman, J.W.: Some fundamental properties of speckle. Journal of the Optical 
Society of America 66(11), 1145-1150 (1976) 

5. Lee, J.S.: Speckle suppression and analysis for synthetic aperture radar images. 
Optical Engineering 25(5), 636-645 (1986), iSSN 0091-3286 

6. Moschetti, E., Palacio, M.G., Picco, M., Bustos, O.H., Frery, A.C.: On the use of 
Lee's protocol for speckle-reducing techniques. Latin American Applied Research 
36(2), 115-121 (Apr 2006) 

7. Nagao, M., Matsuyama, T.: Edge preserving smoothing. Computer Graphics and 
Image Processing 9(4), 394-407 (Apr 1979) 

8. Nascimento, A.D.C., Cintra, R.J., Frery, A.C.: Hypothesis testing in speckled data 
with stochastic distances. IEEE Transactions on Geoscience and Remote Sensing 
48(1), 373-385 (Jan 2010) 

9. Salicru, M., Morales, D., Menendez, M.L., Pardo, L.: On the applications of 
divergence type measures in testing statistical hypotheses. Journal of Multivariate 
Analysis 21(2), 372-391 (Nov 1994) 



Speckle Reduction using Stochastic Distances 9 



10. Wang, Z., Bovik, A.C.: A universal image quality index. IEEE Signal Process. 
Letters 9(3), 81-84 (2002) 



